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Not long ago, it seemed inconceivable rhrt c 
proteins could be designed from scratch. Be- 
cause each protein .sequence has an astro- 
nomical number of perennial conformations, 
it appeared that only hp experimentalist with 
the evolutionary life span of Mother Nature 
could design a sequence capable of folding 
into a single, well-defined three-dimensional 
.sr.mccure. But now, on page 82 of this issue, 
Dahiyat and Mayo (J ) describe 
a new approach that makers de 
novo protein design as cany as 
running h computer program. 
Well almost. . . 

Tlie intellectual roots of this 
new work go back to the early 
1980s when protein engineers 
first thought about designing 
proteins (2), At that point, the 
prediction of a protein's three- 
dimensional structure from its 
sequence alone seemed a diffi- 
cult proposition. However, diey 
opined that the inverse prob- 
lem — designing an amino acid 
sequence capable of assuming a 
desired three -dimensional struc- 
ture — would be 3 more rmcmblc 
problem, because one could 
"over-engineer" the system ro "fa- 
vor the de-si reel folding pattern. 
Thus, the problem of dc novo protein design 
reduced to two steps: selecting a desired ter- 
tiary structure and finding a sequence that, 
would stabilize this fold. Dahiyat and Mayo 
have now mastered the second step with spec- 
tacular success. They have distilled the rules, 
insights, and paradigms gleaned from two de- 
cades of experiments (3) into a single compu- 
tational algorithm rhat predicts an optimal 
sequence for a given fold. Further, when put to 
the test the algorithm actually predicted a 
sequence that folded into the desired three- 
dimensional structure. Thus, the rules of pro- 
tein biding and computational methods for 
dc novo design may now he sufficiently de- 
fined to allow the engineering of a variety of 
proteins. 

Dahiyat and Mayn's program divides the 
interactions that stabilize protein structures 



Tlx* author is in the Departmfint of Riorhcilifelry and 
Biophysics, University ol Pennsylvania SchonJ of Medi- 
cine. WiLiUttlphia. PA 19104-6059, ijka. E-maiT 
wHnq ro do<Sma ILmacf . upenn. odu 



into three categories: internet ions of side 
chains that arc exposed to solvent, of side 
chains buried in the protein interior, and of 
parts of the protein that occupy more interfa- 
cial positions. Exposed residues contribute to 
stability, primarily through conformational 
preferences and weakly attractive, solvent- 
exposed polar interactions (4). The burial of 
hydrophobic re-si dues in the well-packed in 




Better than the real thing. The nalural zino finger protein 2|f268 (left) is 
stabilized In part by a cor© of hydrophobic (green) side chains and meiwf- 
chalating fcidrt chains (red). In lh« designed protein FSD-1 (right), the 
7M£w eon* is retained but the metsl-chelating His residues and one of the 
Cys residues of Zif268 are converted to hydrophobic Pho and Ata rosi- 
dues, thereby extending the hydrophobic core. ThQ fourth metal llgand 
Cys 8 is converted to a Lys residue. The apolar portion of (his Interfadal 
residue shields the hydrophobic core, whereas its ammonium group iy ex- 
posed to solvent. The helix is also stabilized by ah N-capping interaction 
( 79), which presumably also stabilizes ihy structure. 



i:erior of a protein provides an even more 
powerful driving force for folding. The side 
chains in the interior of a protein adopt 
unique conformations, r.he prediction of 
which is a large combinatorial problem. 

One important simplifying assumption 
arose from the early work of Jainin ec of. (5), 
who showed that each individual side chain 
can adopt a limited number of low-energy 
conformations (named rotamera), reducing 
the number of probable conformed available 
to a protein. This work was subsequently ex- 
tended co the design of proteins containing 
only the most- favorable roomers (6). Al- 
though the side chains in natural proteins 
deviate from ideality in a few cases (compli- 
cating the prediction of the structures of 
natural proteins), these deviations need not 
be considered in the design of idealized pro- 
teins. Thus, various algorithms have been 
developed to examine nil possible hydropho- 
bic residues in all passible rotameric states, to 
find comhiiiHtioas that efficiently fill the in- 
terior of a protein. A complementary ap- 



proach uses genetic methods to exhaustively 
search for sequences capable or filling w pro- 
tein con; (7), and this work Vims been adapted 
for the de novo design of proteins (S). 

Interfacial residues are also quire im- 
portant for protein stability (9, 10). They 
are often amphiphilic (for example, Lys, 
Arg, and Tyr) and their npoiar atoms can 
cap the hydrophobic core, while their po- 
lar groups engage in electrostatic and hy- 
d roge n - bond eel i n t e rac t i ons , 

Until recently, protein designers have fre- 
quently concentrated on quantifying die en- 
ergetics associated with just one of these tbrc:e 
types of interactions (3). However, de novo 
design is best app roach ed by simultaneously 
considering all of the side chains in die pro- 
tein unfortunately, a very high-order com- 
binatorial problem. For instance, the volume 
available to the interior side 
chains depends on die nature and 
conformation of the residues at 
the interfacial positions and vice 
versa* Dahiyat and Mayo assumed 
that oac.h of these three features 
had heen adequately quanti tared 
to provide a useful empirical en- 
ergy function lor protein design. 
Their program combines a num- 
ber of feaures taken from earlier 
potential functions and includes 
a penalty for exposing hydropho- 
bic groups to solvent. Another es- 
sential innovation included in 
their program is an implementa- 
tion of the Dead-fcnd Elimina- 
tion eheore m, to effi ci en 1:1 y 
search through sequence and side 
chain rotamcr space. 

Dahiyat and Mayo's target 
fold is a 2inc finger, a motif with 
a well-established history in protein struc- 
ture prediction and design. In an early, pre- 
scient paper, Berg correcrly inferred that this 
His 2 Cys* Zn-binding motif must feature a (5- 
P-a fold chat would position the ligating 
gtvnips iu a tetrahedral array around the 
bound Zn(II) {11), Favorable metal ion- 
ligand interactions togedicr with a small 
apolar core help stabilize the three-dimen- 
sional structure of this compact fold. More 
recently, hnperiali and co- workers have, de- 
signr.il a peptide that folded into this motif, 
even in the absence of metal ions (12). The 
design included a'D*amino acid to stabilize a 
type IT rum, and a Urge, rigid tricyclic side 
chain rhwt may help consolidate the hydro- 
phobic core.. This work was particularly ex- 
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cicinp because, heforc their studies, it was not 
^pected that sequences as short as 25 resi- 
dues in length could fold into stable tertiary 
structures. 

Now, Dahiyat and Mayo take these studies 
one step further through the design of a se- 
quence composed nf only natural amino acids 
, that adopts the zinc finper motif. As input to 
( their program, they introduced the coordi- 
nates of the backbone atoms from the erystHl 
structure of the second domain of the 2inc 
finger protein - Ztf268i The program then 
evaluated a total of 10* : possible side chain- 
rotamer combinations to find a sequence ca- 
pable of stabilizing this fold without a bound 
metal ion. The resulting protein sequence 
shares a small hydrophobic core with its pre- 
decessor from Zl£268. However, in the newly 
designed protein FSD-1 the core is enlarged 
through the addition of hydrophobic resi- 
dues that fill the space vacated by the re- 
moval of the metal-binding site {see the fig- 
ure). This increase in the size of the hydro- 
phobic core together with the enhancements 
in the propensity for forming the appropriate 
secondary structure provide an adequate 
driving force for folding. The designed 
miniprotein actually folds into the desired 
structure as assessed by nuclear magnetic, 
resonance spectroscopy, and the observed 
structure closely resembles the three-dimen- 
sional structure of Zif268. 

Because of its small size, the protein is 
marginally stable. A Van't Hoff analysis of 
the thermal unfolding curve gives a change 
in the enthalpy (AH v h) of approximately 
-10 kcal/mol T and indicates that the protein 
is about 90 to 95% folded at low tempera- 
tures (i3). The small value AH vH and the 
lack of strong cooperativity in the unfolding 
transition are expected for a native-like pro- 
tein of th is very smal I size ( J 4 ) . Thus, FSD- 1 
is the smallest protein known to be capable 
of folding into a unique structure without the 
thermodynamic assistance of disulfides, 
metal ions, or other sublimes. This important 
accomplishment illustrates the impressive 
ability of Dahiyat and Mayo's program to 
design highly optimized sequences, 

This new achievement caps a banner year 
for de novo protein design. Earlier, Regan (15) 
answered the challenge of changing a protein's 
tertiary structure hy altering no more than 50% 
of its sequence. And although Dahiyat and 
Mayo have demonstrated that the stabilizing 
metal -binding site is not necessary in their sys- 
tem, Caradonna, Hellinga, and co-workers 

(16) have made impressive progress in auto- 
mating the introduction of functional metal- 
binding sites into the three-dimensional struc- 
tures of natural proteins. Further, other workers 

(17) have used less automated approaches co 
successfully introduce functionally and spec 
troscopically Interesting metal-binding sites 
into de novo designed proteins* 



To date, the most computationally inten- 
sive protein design problems have been the 
redesign of natural proteins of known three- 
dimensional structure But the new automated 
approaches open the door to the de novo design 
of structures with entirely novel backbone con- 
formations. It will be interesting to sec if 
Dahiyat and Mayo's approach of designing an 
optimal sequence for a given fold is sufficient 
or if it will be necessary also to destabilize alter* 
nate possible folds. Indeed, when using an ear- 
lier version of their algorithm to repack the 
interior of the coiled coil from GCN4, they had 
to retain the identity of a buried Asn residue 
from ihc wild-type protein. Although the in- 
clusion of this Asn actually destabilized the 
desired fold, it was nevertheless essential to 
avoid the formation of alternate, unwanted 
conformers (18), The ability to ask such fo- 
cused questions will reveal much about how 
natural proteins adopt their folded conforma- 
tions while simultaneously allowing the design 
nf entirely new polymers for applications rang- 
ing from catalysis to pharmaceuticals. 
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